Search CORE

24 research outputs found

An Algorithm to Compute the Character Access Count Distribution for Pattern Matching Algorithms

Author: Marschall T. (Tobias)
Rahmann S. (Sven)
Publication venue: 'MDPI AG'
Publication date: 01/10/2011
Field of study

We propose a framework for the exact probabilistic analysis of window-based pattern matching algorithms, such as Boyer--Moore, Horspool, Backward DAWG Matching, Backward Oracle Matching, and more. In particular, we develop an algorithm that efficiently computes the distribution of a pattern matching algorithm's running time cost (such as the number of text character accesses) for any given pattern in a random text model. Text models range from simple uniform models to higher-order Markov models or hidden Markov models (HMMs). Furthermore, we provide an algorithm to compute the exact distribution of \emph{differences} in running time cost of two pattern matching algorithms. Methodologically, we use extensions of finite automata which we call \emph{deterministic arithmetic automata} (DAAs) and \emph{probabilistic arithmetic automata} (PAAs)~\cite{Marschall2008}. Given an algorithm, a pattern, and a text model, a PAA is constructed from which the sought distributions can be derived using dynamic programming. To our knowledge, this is the first time that substring- or suffix-based pattern matching algorithms are analyzed exactly by computing the whole distribution of running time cost. Experimentally, we compare Horspool's algorithm, Backward DAWG Matching, and Backward Oracle Matching on prototypical patterns of short length and provide statistics on the size of minimal DAAs for these computations

CWI's Institutional Repository

Discovering motifs that induce sequencing errors

Author: Allhoff M.C. (Manuel)
Costa I.G.
Marschall T. (Tobias)
Martin M. (Marcel)
Rahmann S. (Sven)
Schönhuth A. (Alexander)
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2013
Field of study

CWI's Institutional Repository

Discovering motifs that induce sequencing errors

Author: Allhoff M.C. (Manuel)
Costa I.G.
Marschall T. (Tobias)
Martin M. (Marcel)
Rahmann S. (Sven)
Schönhuth A. (Alexander)
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2013
Field of study

CWI's Institutional Repository

Reliable transfer of transcriptional gene regulatory networks between taxonomically related organisms

Author: A Paccanaro
AD Gonzalez Perez
AE Kel
AG Perez
AJ Enright
Andreas Tauch
CO Pabo
DJ Galas
I Brune
I Matic
J Baumbach
J Baumbach
J Baumbach
J Baumbach
J Baumbach
J Baumbach
Jan Baumbach
K Brinkrolf
LM Hellman
LV Sun
M Beckstette
M Madan Babu
M Tompa
RL Tatusov
S Balaji
S Balaji
S Rahmann
S Rahmann
SA Teichmann
SF Altschul
Sven Rahmann
T Wittkop
V Espinosa
WB Alkema
Publication venue: BioMed Central
Publication date: 01/01/2009
Field of study

Baumbach J, Rahmann S, Tauch A. Reliable transfer of transcriptional gene regulatory networks between taxonomically related organisms. BMC Systems Biology. 2009;3(1):8.Background: Transcriptional regulation of gene activity is essential for any living organism. Transcription factors therefore recognize specific binding sites within the DNA to regulate the expression of particular target genes. The genome-scale reconstruction of the emerging regulatory networks is important for biotechnology and human medicine but cost-intensive, time-consuming, and impossible to perform for any species separately. By using bioinformatics methods one can partially transfer networks from well-studied model organisms to closely related species. However, the prediction quality is limited by the low level of evolutionary conservation of the transcription factor binding sites, even within organisms of the same genus. Results: Here we present an integrated bioinformatics workflow that assures the reliability of transferred gene regulatory networks. Our approach combines three methods that can be applied on a large-scale: re-assessment of annotated binding sites, subsequent binding site prediction, and homology detection. A gene regulatory interaction is considered to be conserved if (1) the transcription factor, (2) the adjusted binding site, and (3) the target gene are conserved. The power of the approach is demonstrated by transferring gene regulations from the model organism Corynebacterium glutamicum to the human pathogens C. diphtheriae, C. jeikeium, and the biotechnologically relevant C. efficiens. For these three organisms we identified reliable transcriptional regulations for similar to 40% of the common transcription factors, compared to similar to 5% for which knowledge was available before. Conclusion: Our results suggest that trustworthy genome-scale transfer of gene regulatory networks between organisms is feasible in general but still limited by the level of evolutionary conservation

Crossref

Springer - Publisher Connector

PubMed Central

Publications at Bielefeld University

Next-generation RNA sequencing reveals differential expression of MYCN target genes and suggests the mTOR pathway as a promising therapy target in MYCN-amplified neuroblastoma

Author: Barann M.
Büchel G.
Eggert A. (Angelika)
Esser D.
Fielitz K.
Heilmann M.
Köster J. (Johannes)
Marschall T. (Tobias)
Martin M. (Marcel)
Rahmann S. (Sven)
Rosenstiel P.
Schramm A. (Alexander)
Schulte J.H. (Johannes)
Publication venue: 'Royal College of Obstetricians & Gynaecologists (RCOG)'
Publication date: 01/02/2013
Field of study

CWI's Institutional Repository

Epigenetic dynamics of monocyte-to-macrophage differentiation

Author: Andreas S. Richter
Benedikt Brors
Bernhard Horsthemke
Bärbel Felder
Christopher Schröder
Claudia Haak
Corinna Attenberger
Daniela Beißer
Elsa Leitão
Fabian Müller
Filippos Klironomos
Gerd Schmitz
Gideon Zipprich
Gilles Gasparoni
Jörn Walter
Karl Nordström
Laura Arrigoni
Ludger Klein-Hitpass
Matthias Barann
Nikolaus Rajewsky
Peter Ebert
Philip Rosenstiel
Sebastian Fröhler
Stefan Wallner
Sven Rahmann
Tea Berulava
Thomas Lengauer
Thomas Manke
Ulrike Bönisch
Wei Chen
Publication venue: Springer Nature
Publication date: 01/01/2016
Field of study

Background Monocyte-to-macrophage differentiation involves major biochemical and structural changes. In order to elucidate the role of gene regulatory changes during this process, we used high-throughput sequencing to analyze the complete transcriptome and epigenome of human monocytes that were differentiated in vitro by addition of colony-stimulating factor 1 in serum-free medium. Results Numerous mRNAs and miRNAs were significantly up- or down-regulated. More than 100 discrete DNA regions, most often far away from transcription start sites, were rapidly demethylated by the ten eleven translocation enzymes, became nucleosome-free and gained histone marks indicative of active enhancers. These regions were unique for macrophages and associated with genes involved in the regulation of the actin cytoskeleton, phagocytosis and innate immune response. Conclusions In summary, we have discovered a phagocytic gene network that is repressed by DNA methylation in monocytes and rapidly de-repressed after the onset of macrophage differentiation

University of Regensburg Publication Server

Springer - Publisher Connector

High-throughput microarray technology in diagnostics of enterobacteria based on genome-wide probe selection and regression analysis

Author: A Haraga
A Iguchi
A Lehner
A Loy
Angelika Fruth
B Korczak
Baum H von
BW Wren
C Fraley
C Jenkins
C Pelludat
CH Chiu
DA Rasko
DE Fouts
DL Paterson
E Brzuszkiewicz
Eliora Ron
F Yang
Florian Gunzer
FR Blattner
G Bruant
GA Grassl
GN Schroeder
H Giamarellou
H Nie
H Willenbrock
H Willenbrock
HG Park
IT Paulsen
J Hejnova
J Letowski
J Parkhill
J Parkhill
J Sambrook
J Wei
J Zdziarski
JB Kaper
JC Engelmann
JE Larkin
JG Frye
Jörg Hacker
K Ballmer
K Hayashi
K Oshima
K Sen
KE Rodriguez-Siek
L Grozdanov
L Sanchez
LA Wilson
M McClelland
M McClelland
M McClelland
M Pritsker
M Wagner
MM Venkatesan
N Ahmed
N Dorrell
N Kobayashi
NR Thomson
NR Thomson
NT Perna
O Clermont
PE Saebo
PS Chain
PS Chain
Q Jin
R Podschun
RA Welch
RDC Team
RJ Case
Roy R. Chaudhuri
S Bekal
S Kariyawasam
S Porwollik
S Rahmann
S Rahmann
S Rahmann
SE McNamara
SF Altschul
SJ Hinchliffe
SL Chen
Sven Rahmann
T Barl
T Durfee
T Hayashi
T Hothorn
T Hothorn
T Kostic
T Wirth
Thomas Dandekar
TJ Gentry
TJ Johnson
Tobias Müller
Torben Friedrich
U Dobrindt
U Dobrindt
Ulrich Dobrindt
W Chen
W Deng
W Deng
W Han
W Huber
WF Fricke
Wilfried Weigel
Wolfgang Rabsch
X Wang
X Wang
Y Song
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Epigenetic dynamics of monocyte-to-macrophage differentiation

Author: AL Sullivan
Andreas S. Richter
BE Bernstein
Benedikt Brors
Bernhard Horsthemke
Bärbel Felder
Christopher Schröder
Claudia Haak
CM Rivera
Corinna Attenberger
CY McLean
D Gosselin
D Kim
D Kim
D Warde-Farley
DA Hume
DA Hume
Daniela Beißer
E Arner
Elsa Leitão
F Ramírez
Fabian Müller
Filippos Klironomos
GE Tusnady
Gerd Schmitz
Gideon Zipprich
Gilles Gasparoni
H Li
J Beygo
J Beygo
J Ecker
J Ecker
J Stöhr
JK Alder
JR Dixon
Jörn Walter
K Adelman
K Rademacher
Karl Nordström
KD Hansen
KD Pruitt
KS Zaret
L Arrigoni
L Burger
L Escoubet-Lozach
L-C Li
Laura Arrigoni
Ludger Klein-Hitpass
M Gardiner-Garden
M Klug
M Klug
Matthias Barann
MI Love
MJ Ziller
MR Friedländer
Nikolaus Rajewsky
P Sood
Peter Ebert
Philip Rosenstiel
R Andersson
R Vento-Tormo
RE Harrison
RJ Mayoral
S Rahmann
S Saeed
Sebastian Fröhler
Stefan Wallner
Sven Rahmann
T Arányi
Tea Berulava
Thomas Lengauer
Thomas Manke
TK Kelly
Ulrike Bönisch
V Matys
VC Jiménez
W Xu
WA Pastor
Wei Chen
X Liao
X Zhang
Y Liu
Z Cao
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Computational pan-genomics: Status, promises and challenges

Author: Abeel T. (Thomas)
Alkan C. (Can)
Baaijens J.A. (Jasmijn)
Bakker P.I.W. (Paul) de
Boeva V. (Valentina)
Bonnal R.J.P. (Raoul)
Chiaromonte F. (Francesca)
Chikhi R. (Rayan)
Ciccarelli F.D. (Francesca)
Cijvat C.P. (Robin)
Datema E. (Erwin)
Dijkstra L.J. (Louis)
Duijn C.M. (Cornelia) van
Dutilh B.E. (Bas)
Eichler E.E. (Evan)
El-Kebir M. (Mohammed)
Ernst C. (Corinna)
Eskin E. (Eleazar)
Garrison E. (Erik)
Ghaffaari A. (Ali)
Guryev V. (Victor)
Kersey P. (Paul)
Klau G.W. (Gunnar)
Kloosterman W.P. (Wigard)
Korbel J.O. (Jan)
Lameijer E.-W. (Eric-Wubbo)
Langmead B. (Benjamin)
Marschall T. (Tobias)
Martin M. (Marcel)
Marz M. (Manja)
Medvedev P. (Paul)
Mu J.C. (John)
Mäkinen V. (Veli)
Neerincx P.B.T. (Pieter)
Novak A.M. (Adam)
Ouwens K. (Klaasjan)
Paten B. (Benedict)
Peterlongo P. (Pierre)
Pisanti N. (Nadia)
Porubsky D. (David)
Rahmann S. (Sven)
Raphael B.J. (Benjamin)
Reinert K. (Knut)
Ridder D. (Dick) de
Ridder J. (Jeroen) de
Rivals E. (Eric)
Sanders A.D. (Ashley)
Schlesner M. (Matthias)
Schulz-Trieglaff O. (Ole)
Schönhuth A. (Alexander)
Sheikhizadeh S. (Siavash)
Shneider C. (Carl)
Smit S. (Sandra)
The Computational Pan-Genomics Consortium
Valenzuela D. (Daniel)
Vandin F. (Fabio)
Wang J. (Jiayin)
Wessels L.F.A. (Lodewyk)
Ye K. (Kai)
Zhang Y. (Ying)
Publication venue: 'Oxford University Press (OUP)'
Publication date: 01/01/2018
Field of study

Many disciplines, from human genetics and oncology to plant breeding, microbiology and virology, commonly face the challenge of analyzing rapidly increasing numbers of genomes. In case of Homo sapiens, the number of sequenced genomes will approach hundreds of thousands in the next few years. Simply scaling up established bioinformatics pipelines will not be sufficient for leveraging the full potential of such rich genomic data sets. Instead, novel, qualitatively different Computational methods and paradigms are needed.We will witness the rapid extension of Computational pan-genomics, a new sub-area of research in Computational biology. In this article, we generalize existing definitions and understand a pangenome as any collection of genomic sequences to be analyzed jointly or to be used as a reference. We examine already available approaches to construct and use pan-genomes, discuss the potential benefits of future technologies and methodologies and review open challenges from the vantage point of the above-mentioned biological disciplines. As a prominent example for a Computational paradigm shift, we particularly highlight the transition from the representation of reference genomes as strings to representations

CWI's Institutional Repository

Erasmus University Digital Repository

Massively parallel read mapping on GPUs with the q-group index and PEANUT

Author: Köster J. (Johannes)
Rahmann S. (Sven)
Publication venue: 'PeerJ'
Publication date: 01/09/2014
Field of study

We present the q-group index, a novel data structure for read mapping tailored towards graphics processing units (GPUs) with a small memory footprint and efficient parallel algorithms for querying and building. On top of the q-group index we introduce PEANUT, a highly parallel GPU-based read mapper. PEANUT provides the possibility to output both the best hits or all hits of a read. Our benchmarks show that PEANUT outperforms other state-of-the-art read mappers in terms of speed while maintaining or slightly increasing precision, recall and sensitivity

CWI's Institutional Repository

Directory of Open Access Journals

PubMed Central